PerfMiner: Cluster-Wide Collection, Storage and Presentation of Application Level Hardware Performance Data

نویسندگان

Philip Mucci

Daniel Ahlin

Johan Danielsson

Per Ekman

Lars Malinowski

چکیده

We present PerfMiner, a system for the transparent collection, storage and presentation of thread-level hardware performance data across an entire cluster. Every sub-process/thread spawned by the user through the batch system is measured with near zero overhead and no dilation of run-time. Performance metrics are collected at the thread level using tool built on top of the Performance Application Programming Interface (PAPI). As the hardware counters are virtualized by the OS, the resulting counts are largely unaffected by other kernel or user processes. PerfMiner correlates this performance data with metadata from the batch system and places it in a database. Through a command line and web interface, the user can make queries to the database to report information on everything from overall workload characterization and system utilization to the performance of a single thread in a specific application. This is in contrast to other monitoring systems that report aggregate system-wide metrics sampled over a period of time. In this paper, we describe our implementation of PerfMiner as well as present some results from the test deployment of PerfMiner across three different clusters at the Center for Parallel Computers at The Royal Institute of Technology in Stockholm, Sweden.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

scc: cluster storage provisioning informed by application characteristics and SLAs

Storage for cluster applications is typically provisioned based on rough, qualitative characterizations of applications. Moreover, configurations are often selected based on rules of thumb and are usually homogeneous across a deployment; to handle increased load, the application is simply scaled out across additional machines and storage of the same type. As deployments grow larger and storage ...

متن کامل

Distributed Paged Hash Tables

In this paper we present the design and implementation of DPH, a storage layer for cluster environments. DPH is a Distributed Data Structure (DDS) based on the distribution of a paged hash table. It combines main memory with file system resources across the cluster in order to implement a distributed dictionary that can be used for the storage of very large data sets with key based addressing t...

متن کامل

JavaSymphony: A System for Development of Locality-Oriented Distributed and Parallel Java Applications

Most Java-based systems that support portable parallel and distributed computing either require the programmer to deal with intricate low-level details of Java which can be a tedious, timeconsuming and error-prone task, or prevent the programmer from controlling locality of data. In this paper we describe JavaSymphony, a programming paradigm for distributed and parallel computing that provides ...

متن کامل

Informed Provisioning of Storage for Cluster Applications

Today, application providers can choose from a range of storage choices to provision their infrastructure for cluster-based applications . Each storage technology presents a different point in a complex tradeoff space of cost, capacity, and performance . To help application providers choose from these alternatives, we developed scc [1] to automate the selection of cluster storage configurations...

متن کامل

A Storage Architecture for Data - Intensive Computing by Jeffrey Shafer A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE Doctor of Philosophy

A Storage Architecture for Data-Intensive Computing by Jeffrey Shafer The assimilation of computing into our daily lives is enabling the generation of data at unprecedented rates. In 2008, IDC estimated that the “digital universe” contained 486 exabytes of data [9]. The computing industry is being challenged to develop methods for the cost-effective processing of data at these large scales. The...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

PerfMiner: Cluster-Wide Collection, Storage and Presentation of Application Level Hardware Performance Data

نویسندگان

چکیده

منابع مشابه

scc: cluster storage provisioning informed by application characteristics and SLAs

Distributed Paged Hash Tables

JavaSymphony: A System for Development of Locality-Oriented Distributed and Parallel Java Applications

Informed Provisioning of Storage for Cluster Applications

A Storage Architecture for Data - Intensive Computing by Jeffrey Shafer A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE Doctor of Philosophy

عنوان ژورنال:

اشتراک گذاری